Statistics of sums of correlated variables described by a matrix product ansatz 
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We determine the asymptotic distribution of the sum of correlated variables described by a matrix 
product ansatz with finite matrices, considering variables with finite variances. In cases when the 
correlation length is finite, the law of large numbers is obeyed, and the rescaled sum converges to 
a Gaussian distribution. In constrast, when correlation extends over system size, we observe either 
a breaking of the law of large numbers, with the onset of giant fiuctuations, or a generalization 
of the central limit theorem with a family of nonstandard limit distributions. The corresponding 
distributions are found as mixtures of delta functions for the generalized law of large numbers, and 
as mixtures of Gaussian distributions for the generalized central limit theorem. Connections with 
statistical physics models are emphasized. 
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The law of large numbers and the central limit the- 
orem are cornerstones of equilibrium statistical physics. 
Indeed, the very existence of determistic values of macro- 
scopic observables in large systems relies on the law of 
large numbers, while the Gaussian shape of tiny fluctua- 
tions around the mean value are described by the central 
limit theorem. The latter also bears strong connections 
with random walks [1| . Basic forms of these theorems are 
known for independent and identically distributed (i.i.d.) 
random variables IJ^^. Some generalizations with less 
restrictive assumptions are also known [3|, Ij], and it is 
often assumed that these theorems remain valid as long 
as distributions of individual variables are not too broad 
and do not differ too much one from the other, and as 
long as correlations are weak enough. These assumptions 
are however not always valid in nonequilibrium systems, 
where the relevant observables may have an infinite mean 
(e.g., in aging [5[ |6| or laser cooling [7| phenomena), 
or may have long-range correlations (e.g., in boundary 



driven [^, Q or active [Ifl, |ll| systems) , leading to the 
breakdown of the law of large numbers. Similarly, the 
standard central limit theorem breaks down in a num- 
ber of cases. For broadly distributed variables with infi- 
nite variance, the generalized central limit theorem yields 
Levy-stable laws 3] , with many applications often related 
to anomalous diffusion [5|]. Non-Gaussian distributions 
have also been found for instance in the context of 1//"- 
noise ;12'] and related problems 13J, ll4j, where summed 
variables have very different statistics. For strongly cor- 
related variables, generalizations of the central limit the- 
orem have been derived for Gaussian processes [15|, ll6J . 
However, in a statistical physics context, another class of 
strongly correlated variables, defined through a matrix 
product ansatz, has emerged from the exact solution of 
nonequilibrium models like the Asymmetric Simple Ex- 
clusion Process (ASEP) [13, [3- Although infinite matri- 
ces may be needed, notably in the context of the ASEP 
model \xA llSl, a significant number of models can be 
solved using finite matrices, including reaction-diffusion 



lattice models 18- 20ll. and the ASEP model for specific 
parameter values 9|,|21|. In spite of the increasing impor- 
tance of this class of random variables, the correspond- 
ing generalizations of the law of large numbers and of 
the central limit theorem are presently not known. In 
this Letter, we aim at providing such generalizations for 
variables described by a matrix product ansatz with fi- 
nite matrices. Our study encompasses both discrete ran- 
dom variables as in the ASEP model [17| and continuous 
ones as in^ signal processing [22, l23| or in mass transport 
models |2J] . For the sake of clarity, we restrict our pre- 
sentation to specific, yet representative cases, deferring a 
full-length account of our results to a forthcoming publi- 
cation j25l |. 

Matrix product representation- Following 
consider a set of random variables (xi, . . . ,xn) whose 
joint probability distribution can be described by a ma- 
trix product ansatz, namely 
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where 7l{x) is a. Dx D matrix function {D > 2) with real 
nonnegative entries, £ — J_ TZ{x) dx, and iZ is a linear 
form defined as 



C (M) = tr (A^M) 



(2) 



with A a given D x D, nonzero matrix with real non- 
negative entries. We further assume that for all iV > 1, 
C{£^) 7^ 0. Standard forms used in statistical physics for 
the linear form £ [18[ are recovered either by taking A as 
the identity matrix, or by choosing Aij = ViWj so that 
L{M) = (y|M |M^). Eq. dl]) is a natural generalization to 
correlated variables of the i.i.d. case, replacing the prod- 
uct of real functions by a product of matrix functions. 
It is useful to introduce the matrix of distributions 'P{x) 
through the relation 



l-Cij\Xj — ^i'j ' ij\-^)^ 



(3) 



so that Vij (x) can be interpreted as a probability distri- 
bution, normalized to 1. We consider probabilities 'Pij{x) 



with finite mean value rriij and finite variance afj . Note 
that 'Pij(x) is uniquely defined only when £ij ^ 0. 

Hidden Markov Chain representation- As shown in 



22l l23j , the joint probability ([T]) can be reinterpreted us- 
ing the concept of Hidden Markov Chain [26| . Expanding 
the matrix product, one can rewrite Eq. ([T]) as 



P{xi,...,xn) = '^ktPt{xi,...,xn) 



with r = (Fi, . . . , Fat-i-i) a chain of indices, and 
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Eq. (|4]) can be interpreted as a mixture of distributions 
Pr{xi,. . . ,xn) indexed by the chain of indices F, each 
with probability weight kt (note that X^r '^r = !)■ For 
a given F, the distribution Pr{xi, . . . ,xn) describes in- 
dependent, but nonidentically distributed random vari- 
ables. Correlations, when present, thus emerge from the 
mixture of the different distributions Pr ■ 

Interestingly, the probability weight Kp can be inter- 
preted as the probability of a Markov chain 2^, l23 |. 



More precisely, given a starting point Fi and a final point 
F^_i_i, Kr can be recast as the probability of an inhomo- 
geneous Markov chain with transition probabilities 



p(Ffe+i = j\Tk = i, Tn+1 = f) = £i: 
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Note that this Markov chain is of a nonstandard type, 
due to the dependence on the final state Fjv+i. Both 
the initial and final states thus need to be selected in 
advance, and their joint distribution is given by: 



p(Fi = i, Tn+1 = /) = Aif 
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Hence the procedure to simulate the correlated random 
variables described by Eq. (JU is the following [23]: (i) Fi 
and FAr_|_i are chosen at random according to distribution 
(jH]); (ii) the random chain F is obtained from transition 
rates ([7]); (iii) the random variables Xk {k — 1, . . . N) are 
drawn randomly from the distributions 7^rfcrfc+i(a;fc)- 

As seen in Eq. (O, the matrix £ plays a key role in 
statistical properties of the Markov chain F, which in turn 
determine correlations between the x^'s. It can be shown 
in particular (see [23| for examples) that £ determines 
the short-range or long-range nature of the correlations. 
If the eigenvalue Ai of £ with the largest modulus has a 
multiplicity of 1 , the correlation Cfcz = {xkXi) — {xk){xi) is 
quasi-stationary, short-ranged and decays exponentially 
to zero at large |fc — Z|. A prominent class of matrices 
obeying this property is the class of aperiodic irreducible 



matrices (see below), due to Perron- Frobenius theorem 
27| . In contrast, if the multiplicity of Ai is strictly larger 
than 1, the correlation C^i (or more precisely, the full 
dependence structure) is generically long-ranged. Two 
situations may arise in this case, according to the Jordan 
decomposition of the matrix £ [28j. If Ai is associated to 
a diagonal block, the correlation Cki decays to a constant 
(and generically nonzero) value for large |/c — ^|. If Ai is 
associated to a nondiagonal block, the correlation Cki is 
nonstationary, and becomes a function of k/N and IJN, 
so that correlations also extend over system size [9|, |23| . 
We come back below to these different classes of matrices. 
Statistics of the sum- We now focus on the study of 
the statistical properties of the sum S = X]fc=i ^fe> i^ ^^^ 
limit A^ — > oo . We are specifically interested in the valid- 
ity of the law of large numbers and of the central limit 
theorem. Introducing the variable s = S/N, the law of 
large numbers breaks down if the limit distribution \l/(s) 
for A — >■ oo does not reduce to a delta function. When 
the law of large numbers holds, namely ^(s) = S{s — m), 
one can investigate the fluctuations of S around its mean 
value Nm on a scale ^ VN, through the rescaled variable 



S~ Nm 



N 



(9) 



When correlations are weak enough, the central limit the- 
orem should hold, and the distribution ^n{z) should con- 
verge to a Gaussian when N ^- oo. 

The Hidden Markov Chain formalism proves very use- 
ful in order to study the statistical properties of the sum, 
as it allows to separate two different sources of random- 
ness: the random choice of the chain F, and the ran- 
dom choice of (xi, . . . , xn) from the distribution Pr, for 
a fixed F. Hence the distribution of S can be determined 
by first computing the distribution of Sr = X]fc=i^fcj 
where the Xk's are drawn from Pr, and then averaging 
the distribution of Sr over F. For a given chain F, we 
introduce the fraction i^ij of transitions from i to j in F, 

Vij = -TT cardjfc, F^ = i and Ffe+i = j} . (10) 
For a given F, the sum Sr can be rewritten as 
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1, . . . , Nvij, are i.i.d. ran- 



where the variables X 
dom variables drawn from the distribution Vij. Let us 
first investigate the validity of the law of large numbers 
for the sum S. For a fixed F, one can write 
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Given that, for fixed («,j), the variables Xj^-'' are i.i.d., 
the law of large numbers can be applied to the term be- 
tween brackets, which thus converges (almost surely) to 
the mean value mij. Hence, for a fixed v — {vij), the 
conditional distribution of s converges when A'^ — > oo to 



^(s|i/) = (5 s 
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»J=1 
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(13) 



As the only dependence over T is through i/, the average 
over r can be replaced by an average over v. The limit 
distribution ^(s) is thus obtained by averaging ^(sji') 
over the asymptotic {N — )• oo) distribution of v^ denoted 
as Q{v): 



*(s) = / n dv,, Q{v) si 



i,j = l 



D 



(14) 



The law of large numbers holds either when all rriij^s 
are equal (or at least those associated to nonzero Vij), 
or when the empirical frequencies Vij converge to non- 
random values Vij in the limit N ^^ oo, in which case 
the rescaled sum s converges to the deterministic limit 



m 



T.^, V 
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When the law of large numbers is sat- 
isfied, the validity of the central limit theorem can then 
be investigated. To this aim, we use the variable z de- 
fined in Eq. (|9]), and follow a similar path as above. For 
a given chain F, and thus a given v, one has 



Sr- Nm 



N 
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X 



(y) 



- Nvi^rriij 



V^ 



As for fixed (z,j), the variables Xj^''' 



(15) 
are i.i.d. with fi- 
nite variance, the central limit theorem applies to the 
sum between brackets, and the distribution of this sum 
converges, for A'^ — ;> cxd, to a centered Gaussian distribu- 
tion of variance aij . Hence the conditional distribution 
^(zlz^) is a centered Gaussian distribution of variance 
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Vij a^j . 



Averaging over z/ yields the distribution 



$(z)=/i}M^!dl^e-^V(.E.,..4). (16) 



27rE»j^u^ 



The central limit theorem then holds either when all aij 's 
(associated to nonzero Vij) are equal, or when the Vij^s 
take deterministic values. When the central limit theo- 
rem does not hold, the distribution ^{z) is a mixture of 
Gaussian distributions. 

As seen above, the distribution Q{i') is thus the key in- 
gredient to characterize the limit distributions ^(s) and 
(when the law of large numbers holds) ^{z). In the fol- 
lowing, we determine Qii') in representative cases, yield- 
ing typical examples of limit distributions. 

Irreducible matrix £.- Let us first consider the case 
where the matrix £ is irreducible. To check this property. 



given that £ is nonnegative, one can consider the asso- 
ciated directed graph: introducing D nodes, a directed 
link is defined from node i to node j if £ij > 0. The 
matrix £ is said to be irreducible if the graph is strongly 
connected (which means that for any («, j), there exists a 
directed path from i to j). If £ is irreducible and aperi- 
odic 31], the Perron- Frobenius theorem [27| implies that 
£ admits a dominant positive eigenvalue Ai, namely all 
other eigenvalues Ai are such that |Ai| < Ai. The eigen- 
vector ^l associated to Ai has positive components, and 
can be normalized such that X^jMI 1- Apart from 

boundary effects appearing when N — k <ti N , the tran- 
sition probability of the Hidden Markov Chain becomes 
homogeneous, that is, independent of k and of the final 
state /, 



piTk+i = j\Tk =i)^£ 



M 



(1) 



AiMi 



(17) 



From standard results of homogeneous Markov Chain 
theory 27], the chain converges to a unique stationary 
state described by a probability qi — /i,- p\ to occupy 
state i, where p*^^' is the Ai-eigenvector of £^ . The 
convergence towards the stationary state implies that. 



for N 



converges (almost surely) to the av- 



,(1 



(1) 



erage value Fy — qi£ijfij ' /\i^\ ' . The empirical fre- 
quencies Vij thus do not fluctuate and both the law 
of large numbers and the central limit theorem hold. 
This can be seen formally by plugging the distribution 

<3(^) = \\.^,2^'K'^^3-^^i) mto Eqs. ^ and ^. In- 
terestingly, this result is independent of the form of the 
matrix A. Note also that the above result is consistent 
with the fact that correlations are short-ranged when the 
matrix £ has a single dominant eigenvalue. 

Diagonalizable reducible matrix £.- Nonstandard dis- 
tributions are expected to emerge when correlation ex- 
tends over system size. As mentioned above, this is true 
when the multiplicity of the largest eigenvalue of the ma- 
trix £ is strictly larger than 1, which implies that the 
matrix is reducible. A simple example of such a situa- 
tion is when the multiplicity is equal to D. Assuming £ 
to be diagonalizable, it is then clear that £ is proportional 
to the identity matrix /. The proportionality factor can 
be scaled out by a redefinition of TZ{x) and thus of £ 
[see Eq. ([J)], so that we set £ — I. From Eq. (O, one 
sees that the only possible chains are the constant chains 
F^') = (i, j, . . . , i), for i = 1, . . . D. For a given chain F^'^ 
one thus has vu = 1 and Vki — for {k,l) ^ {i,i)- In 
addition, Eq. ^ implies that the chain F*^*) appears with 
probability qi = An/ ^ -^jj- As a result, one finds 



1 ^ 






, (18) 



which, from Eq. ([Ti)) , leads to a generalization of the law 



of large numbers, 

1 ^ 



(19) 



If all the ma 's are equal, the standard law of large num- 
bers holds, and using Eq. ([TS]). the central limit theorem 
is generalized into 
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Nondiagonalizable reducible matrix £.- Remaining in 
the framework of reducible matrices £ with a multiple 
dominant eigenvalue (so that correlation is generically 
long-ranged), we turn to the nondiagonalizable case. A 
simple and representative example of this case consists 
in considering a Jordan block. 



S ^ XI + U, U ^ 
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(21) 



Through both a change of basis and a rescaling of £, one 
can set A = 1. From Eq. ([5]), chains F having nonzero 
probability contain only transitions from i to i + I. If 
Ain y^ 0, it can be shown that with probability 1, F 
starts from Fi = 1 and end at Fat+i = D, in the limit 
N —i' oo. Moreover, Eq. ([5]) implies that all such chains 
are equiprobable. Given that for i ^ j, these chains have 
at most one transition from i to j, the corresponding em- 
pirical frequencies Vij converge to zero for N ^ oo. Only 
the frequencies vu are nonzero, and the equiprobability of 
chains implies that they are uniformly distributed, under 
the constraint ^^ vu — 1. Hence, at odds with previous 
cases, empirical frequencies have a continuous distribu- 
tion in the limit N ^ oo. 






(22) 



From Eq. (|14p , the generalized law of large numbers reads 

(23) 
The distribution ^(s) is piecewise polynomial, with sup- 
port [min{mii}, max{TOii}] (see below for an explicit ex- 
ample). If all the TOii's are equal, the standard law of 
large numbers holds, and a generalized form of the cen- 
tral limit theorem is obtained from Eq. (J16p as 
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FIG. 1: Limit distribution $(z) generalizing the Gaussian 
distribution in the case of two-dimensional matrices -see 
Eq. (|26p . for different ratios awjcrii- Parameters are cho- 
sen such that all distributions have variance 1. The Gaussian 
distribution is recovered for an — 022- Inset: same data on 
a semi-logarithmic scale. 



More generally, for an arbitrary matrix £, an analysis 
can be performed in terms of decomposition of £ into 
irreducible blocks. Ergodicity is then achieved within a 
given block, and irreversible transitions (similar to the 
transitions in the simple nondiagonalizable case above) 
take place. The final result for the distribution ^(s) (or 
when applicable, for $(2;)) is generically a complicated 
mixture, both continuous and discrete, of standard laws. 
Details will be given in a forthcoming publication I25|. 

Two-dimensional matrix £.- To give a more explicit 
example, we now consider the case of a two-dimensional 
matrix £. Nonstandard limit distributions emerge when 
correlation extends over system size, which for D = 2 
happens only if £ takes the form 



£ = 



X 7 
A 



A > 0, 7>0, 



(25) 



or its transpose, li j = 0, £ — XI, and the limit distri- 
butions are given by Eqs. HH) and ([201) with D ^ 2. If 
7 > 0, £ is nondiagonalizable and the limit distributions 
are given by Eqs. ((23|) and (|24p. which take a relatively 
simple form for D — 2. Indeed, ^(s) is simply a uniform 
distribution on the interval [min{TO.ii}, max{TOii}], and if 
mil = TO22, 



H^) = \l- 



da 



< 



^-^-/2a^ 



(26) 



This distribution is illustrated in Fig. [T]for different val- 
ues of cr22/o'ii, keeping the variance of $(2) fixed to 1. 
Increasing a-iij an makes the central peak of the distribu- 
tion sharper, while the tails remain essentially Gaussian. 
Discussion.- It is interesting to try to relate the above 
results to the behavior of some statistical physics mod- 
els. For a two-dimensional nondiagonalizable matrix. 



we have seen above that the chain F takes the form 
r = (1, . . . , 1, 2, . . . , 2), with a single transition from state 
1 to state 2. This property can be illustrated within 
the two-dimensional representation of the ASEP model 
[9|, l2ll | , in which Xk — or 1 characterizes the occupancy 
of site k. Although F cannot be directly observed, it 
can be interpreted as the origin of the front separating 
high and low density regions on the coexistence line of 
the ASEP model [29| . The fact that the position of the 
front is uniformly distributed over the system 29|, |30| pre- 
cisely reflects the random transition in the chain F, and 



results in a uniform distribution \l/(s) on [pniin, Pmax]- In- 
terestingly, this two-dimensional picture also seems qual- 
itatively valid when matrix representations of higher di- 
mensions (either finite or infinite) are needed, since in 
all cases, a single front is present. For finite dimensional 
representations, this property is understood from the fact 
that the Jordan block associated to the dominant eigen- 
value is two-dimensional 2l|, while the situation is less 



clear for infinite matrices. In any case, it would be in- 
teresting to try to extend the present approach to some 
relevant classes of infinite dimensional matrices. 
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